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Abstract. - The dynamic environment in the real world calls for the adaptive techniques for 
information filtering, namely to provide real-time responses to the changes of system data. Where 
many incremental algorithms are designed for this purpose, they are usually challenged by the 
worse and worse performance resulted from the cumulative errors over time. In this Letter, we 
propose two incremental diffusion-based algorithms for the personalized recommendations, which 
integrate some pieces of local and fast updatings to achieve the approximate results. In addition 
to the fast responses, the errors of the proposed algorithms do not cumulate over time, that is to 
say, the global recomputing is unnecessary. This remarkable advantage is demonstrated by several 
metrics on algorithmic accuracy for two movie recommender systems and a social bookmarking 
system. 
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Introduction. — The information overloading prob- 
lem in the Internet era causes many automatic filtering 
techniques to appear [1], among which the personalized 
recommender system is a promising one [2-4]. In the lit- 
erature, the central problem concerned in the design of rec- 
ommender systems is how to improve the accuracy [5-7] 
and diversity [8-10] of recommendations. Recently, the 
well-developed Web 2.0 technique facilitates users to fre- 
quently communicate with the system in an easy way, lead- 
ing to a real-time data flow. How to properly design an 
incremental algorithm that can adaptively give responses 
to the changes of data has thus become an urgent prob- 
lem nowadays. Where some previous studies have already 
considered such an issue [11-17], they arc challenged by 
the worse and worse performance resulted from the cumu- 
lative errors over time. 

Recently, Zhang et al. [18, 19] have successfully applied 
the classical physical processes, such as the heat conduc- 
tion and mass diffusion, to deal with the personalized rec- 
ommendation problem. The original algorithms require 
a kind of steady states, and to arrive at these states is 
time consuming. Zhou et al. [10, 20] thus proposed the 
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simplified versions where only one step of heat conduc- 
tion and/or mass diffusion is taken into account. These 
simplified algorithms are considerably more accurate than 
the standard collaborative filtering [21] and much faster 
yet with competitive accuracy compared with the matrix 
decomposition techniques [22]. In this Letter, we pro- 
pose two adaptive algorithms based on the one-step dif- 
fusion methods, which integrate some pieces of local up- 
datings and can provide very fast response to each unit 
change. More significantly, the error between the re- 
sults of the adaptive algorithms and the static algorithms 
docs not grow larger over time, namely the adaptive al- 
gorithms could keep up with the changes tightly and thus 
no global recomputing is needed. This remarkable feature 
highlights a new methodology to handle the huge-size dy- 
namic systems, and can avoid the serious problems in the 
maintaining-rctraining scheme [14], like the data synchro- 
nization. 

Static algorithms. — A recommender system can be 
represented by a bipartite network [23], which consists of 
a user set U and an item set J. To make the description 
clear, we use lower case letters i, j, k, ... to index the users 
and Greek letters a, (3, 7, . . . to index the items. Each user 
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is connected to the items she/he collected, reviewed, voted 
with high ratings, purchased, etc. Later, we simply use 
collect / collected to stand for all possible relations between 
users and items. The bipartite network can be described 
by an adjacency matrix A — {ai a }, where a,i a = 1 if an 
edge between user i and item a exists, ai a = otherwise. 

A diffusion-based recommendation algorithm applies 
some physical processes to simulate the resource propa- 
gation in the bipartite network. For an arbitrary target 
user, assigning some resources to her/his collected items, 
whether she/he user will collect certain items in the future 
can be predicted according to the resource distribution af- 
ter the diffusion process [20] . Generally speaking, this kind 
of methods can be formulated as /' = Wf, where / is 
the vector representing the initial resource allocation, /' 
is the final resource distribution and W is the propagation 
matrix. Provided the target user i, a straightforward way, 
also adopted in this Letter, is to set the initial vector as 
fa = o-iai namely to assign all the objects collected by user 
i a unit of rcsourc^E The element w a p denotes the frac- 
tion of item /3's initial resource that eventually arrives at 
item a. Sorting all uncollected items by /' in descending 
order, the ones at the top positions are recommended. 

We here consider two simplest diffusion-based algo- 
rithms: the one-step mass diffusion (MD) [20] and heat 
conduction (HC) [100. The propagation matrix of MD is 
denoted by M with 
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Fig. 1: (Color online) Illustration of the diffusion processes by 
MD and HC. (a) and (b) display how a unit of resource initially 
located on item j3 spreads to item a by MD, while (c) and (d) 
show how a unit of resource (a physical explanation of this 
kind of resource is "temperature", see Refs. [10,18] for details) 
initially located on item a spreads to item f3 according to HC. 
As shown in this figure, m a p — hp a = 1/4. 



where ki and hp are the degrees of user i and item /3, 
respectively. Analogously, the propagation matrix H of 
HC reads 

EQ-ia a if3 
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Figure [T] illustrates the processes of MD and HC. Inter- 
estingly, the MD and HC are a pair of reverse processes, 
say M T — H . A direct benefit of this feature is that one 
only needs to compute one propagation matrix and then 
immediately gets the other one by transposition, which 
can reduce the computation when considering the hybrid 
algorithm involving both the two processes [10]. In this 
Letter, we will exploit this feature in a more artful way to 
develop an effective and efficient adaptive algorithm. 

Exact solution on incremental data. Although 
the ability to provide accuracy and diverse recommenda- 
tions of the MD and HC, especially the hybrid version 
combined these two, has been demonstrated by extensively 



1 The initial resource allocation can be designed in more sophisti- 
cated ways (see for example the Ref. [8]), but as shown in the later, 
the initial conditions have no business with the adaptive algorithms, 
and thus we do not intend to discuss it in this Letter. 

2 In Ref. [10], these two algorithms are referred as ProbS and 
HeatS. Since we here deal with the bipartite networks, the diffusion 
from items to items actually contains two steps. 



experimental analysis [10], they are not able to work di- 
rectly in a dynamic environment where the bipartite net- 
work (represented by A) itself varies moment to moment. 
A unit change here is defined as the addition of a new edge, 
whatever it is attached to a new user, a new item, or old 
ones. Accordingly, the total number of edges, I, is natu- 
rally a good indication of the state of the evolving system, 
which is added to all relevant symbols as the superscript 
if necessary. For example, k^ means the degree of the 
item a when there are I edges in the network. A cautious 
yet infeasible method is to recompute every time when a 
new edge is added to the system. Instead, the purpose of 
an adaptive algorithm (or say an incremental algorithm) 
is to derive the recommendation vector in a local 

and fast way that can avoid the global recomputing. 

Before the introduction of the approximately adaptive 
algorithm, we first figure out the exact solution of /' 
for an arbitrary target user. As stated above, the propa- 



3 According to the time resolution of the system, it is very possible 
that several edges are added at the same time. In such a case, we 
always assume that these edges are added one by one in a random 
order. In addition, a unit change may not be the addition of an 
edge, but a removal of an edge like in del.icio.us users are free to 
remove their collections. As shown later, our algorithms can handle 
the removal of edges in the similar way to the addition of edges, and 
thus we here only discuss how to get /' 
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gation matrices of MD and HC can be easily derived by 
each other, so we only present the case for MD, while the 
one for HC can be analogously obtained without any diffi- 
culties. Since for any target user, = M( i+1 )/ (i+1 ), 
and the change of / is straightforward, we only focus on 
the change of M. 
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Fig. 2: (Color online) Illustration of four types of changes to 
the M with the addition of a new edge. The dotted line is 
the new edge added between user i and item a. The lines with 
arrow are the diffusion paths affected by the new edge. And 
the dotted circles emphasize the nodes (users or items) being 
responsible for the corresponding type of changes. 

With the addition of a new edge between user i and 
item a (we first consider the case that both i and a already 
exist in the system) , the propagation matrix changes 
to be M" +1 '. In the microscopic level, the elements of 
iW^ +1 ) can be classified into four types according to the 
way how they change. As illustrated in Fig. 2, the four 
types are: Type I — m^ 1 ^ where item (3 is connected to 

user i (f3 ^ a). Type II — m^ 1 ^ where item a and 7 are 
commonly collected by one or more users rather than the 
user i. Type III — w^ 1 ^ where item (3 is connected to user 

i {(3 7^ a). Type IV — m ^p^ where the item (3 and \x are 
both connected to the user % {(3 ^ a, p^a). 

Denoting by the change of the element mp a , say 

M+l) (.1 + 1) (l) , t^. r, • . 

0r„. = m„„, — m\' a , according to fig. 2, it is easy to 
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derive the following updating rules: 
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Note that, the updating rules shown in Eqs. (3)- (6) do 
not change when the addition of the edge introduces a new 
user of a new item, and only a few extra manipulations 
are required. When introducing a new user i, only type II 
happens. When a new item a is introduced, a new row and 
a new column need to be appended to M and initialized 
to zero, which actually are the initial values of nig^ and 



Table 1: Fundamental statistics of the applied data sets. The 
data sets for Netflix and Delicious are random samples. 



Data Set 


\u\ 


\I\ 


Average Object Degree 


MovieLens 


943 


1574 


52.43 


Netflix 


609 


8824 


11.33 


Delicious 


1623 


32941 


3.77 



m a/3^- I n t ms case the type II won't happen but type I, 
type III and type IV should be considered to update the 
expanded propagation matrix. Finally, when both a new 
user and a new item are added, no change will be made 
to the existed elements in . The only operation is to 
append a new row and a column to and to initialize 

them to zero. 

We have already numerically checked that the updating 
rules in Eqs. (3)-(6) can exactly reproduce the results of 
static algorithm. It seems that there is no need to design 
another approximated algorithm since all the the changed 
elements can be handled by Eqs. (3)- (6). Unfortunately, 
this exact incremental algorithm is facing a dilemma be- 
tween space and timcQ, and thus far away from the real- 
world application. In spite of that, the Eqs. (3)- (6) pro- 
vide a valuable information for analyzing the errors of the 
adaptive algorithms that will be discussed in the next sec- 
tion. 

Adaptive algorithms. — The compressed storage 
(only the nonzero elements are stored) of the adjacency 
matrix makes it difficult to be randomly accessed, how- 
ever, calculating the propagation from a certain item to 
the others, whether by mass diffusion or heat conduction, 
is very efficient in such infrastructure, because it can ac- 
cess the adjacency matrix (or equivalent data structure 
representing the connections between users and items) in 
a sequential wajH. Hence, for the elements belonging to 



4 The exact incremental algorithm, as described by Eqs. (3)-(6), 
actually consists of two parts. Firstly we should identify the ele- 
ments that need to be updated, and then to update them. The 
identification can be further divided into two steps: we should first 
to find out the elements in each type (here we obtain the subscripts, 
such as we know the element m X y belongs to Type IV), and then 
to find out the locations of these elements in the memory space (for 
example, we, or the computer programme, have to know the loca- 
tion of M xy before updating it). According to any known techniques 
of compressing storage for sparse matrix, the identification process 
takes very long time, usually even much longer than the updating 
(to locate a specific element in a compressed stored data structure is 
not a trivial task, even if we can insert a kind of order among these 
elements, it requires O(logS') time where 5 denotes the number of 
relevant elements). On the hand, if all the elements of M and A are 
stored in two two-dimensional arrays, the identification process is 
very quick and thus the whole updating process can be completed in 
a reasonable time. However, it asks for 0(|7"| 2 + |/| X \U\) memory, 
and thus fails to handle the huge-size systems that usually contain 
millions to billions of users and/or items. 

5 For example, we can simultaneously save all the neighboring 
items of each user and all the neighboring users of each item by dy- 
namical linked lists, and thus the propagation process will sequential 
access the elements in a list. 
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Type I and Type II, instead of identifying and updating, 
one can just redo a mass diffusion starting with the item 
a to get the updated values. We call this algorithm the 
adaptive algorithm with first-order approximation, abbre- 
viated as AAF. 
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Fig. 3: AUC, Precision and Recall as the function of the num- 
ber of edges fed to the algorithms on MovieLens data. The 
algorithm starts with 5000 edges. Precision and Recall are 
evaluated with K = 100, 300 and 500, respectively. For all 
numerical results, including the ones shown in Fig. 4 and Fig. 
5, the solid lines represent the results of static algorithm, and 
the diamonds and squares correspond to AAF and AAS. 

In AAF, when an edge is added, only the elements be- 
longing to the first and the second types are updated by 
using the MD starting from the item a. The high effi- 
ciency is guaranteed, whereas the error is introduced for 
the neglect of other changes of Type III and Type IV. De- 



tailedly, an error of ( kn^k^ 



vg n, i j (see Eq. (5)) occurs 
when an element in the third type is not updated, and an 
error of (jc^ +1 ^k^ (see Eq. (6)) occurs when an 

element in the fourth type is not updated. Obviously, the 
latter error is far less than the former one since in a real 
system, many users have collected more than a hundred 
items. Therefore, if the error of the elements in the third 



type can be eliminated, the accuracy of the approximate 
algorithm will be improved significantly. 

The straightforward way to update the elements in the 
third type will add too much computation burden, because 
it needs to simulate the MD process from all items sharing 
at least one common neighboring user with the item a to 
the item a (see Eq. (5)). Thanks to the reversibility found 
for the MD and HC processes, we conquer this difficulty 
skillfully. Instead of MD, an HC process starting from the 
item a is calculated in A" +1 \ then the resource arriving 
at item j3 is Tn^t . By this way, the updating of all 
the elements in the third type is completed by HC once 
from one item, with the same complexity as in AAF. We 
name this algorithm as the adaptive algorithm with second- 
order approximation, abbreviated as AAS. In AAS, when 
an edge to the item a is added, an MD from the item a 
is done to get the updated elements in the first and the 
second types, then an HC from the item a is done to get 
the updated elements in the third type. 
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Fig. 4: AUC, Precision and Recall as the function of the num- 
ber of edges fed to the algorithms on Netflix data. The algo- 
rithm starts with 5000 edges. Precision and Recall are evalu- 
ated with K — 100, 300 and 500, respectively. 
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Experimental results. — To validate that the pro- 
posed adaptive algorithms could provide good approxi- 
mation for the static algorithm, we run numerical simu- 
lations on three real systems: MovieLen^, and 
Deliciou^. The characteristics of these data sets arc sum- 
marized in Table [TJ The original data of MovieLens and 
Netflix record the opinions of users on items with the rat- 
ings from 1 (the worst) to 5 (the best). We convert these 
two rating systems to binary systems by defining a connec- 
tion between a user and an item only if the corresponding 
rating is higher than 2. 

A standard sub-sampling validation method is adopted 
to evaluate the algorithmic performance, which divides all 
edges randomly into two subsets: the training set and the 
testing set. The training set is used as the known infor- 
mation for recommending, while the testing set is used 
to evaluate the prediction and is invisible to the recom- 
mendation algorithms. In this Letter, 10% of edges con- 
stitute the testing set and the other 90% belong to the 
training set. Three widely measurements, AUC, Precision 
and Recall, are applied to quantify the accuracy of rec- 
ommendations (see the review article [5] for details). All 
of them are essentially individual-based indices, that is to 
say, for any user i, the individual measurements AUCf , 
Precision^ and Recallf exist. The system-level indices 
are simply obtained by averaging the individual measure- 
ments over all users. Denoting by T± the set of items 
in the testing set that are collected by user i (they are 
invisible from the training set but favored by user z), and 
= j(0 _ rf ) - T^ l) the set of items not favored yet 
by user i, where T^p is the set of items collected by user 
i in the training set, then AUCf^ is defined as the prob- 
ability a randomly selected item in is assigned a top- 
per position in the recommendation list than a randomly 
selected item in . Clearly, random recommendations 
correspond to AUG « 0.5. Given the length of recommen- 
dation list, K, the Precision^ is defined as the ratio of 
the number of the recommended items belonging to to 
the total of recommended items, K. The Recall^ is the 
ratio of the number of the recommended items belonging 
to Tp to \TP\. Note that, AUC value does not depend 
on K, while the Precision and Recall are if-sensitive. For 
all these three indices, the larger value corresponds to the 
higher recommendation accuracy. 

To imitate the dynamic environment in the real world, 
we sort the edges in the training set by the time when they 
appeared, then feed them to the algorithm one by one. For 
each 5000 edges, we compute the AUC, Precision and Re- 
call. The performance along with the adding of edges is 
shown in Figs. 3-5. As the addition of edges, the known 
information increases and thus the AUC value increases 
too. In contrast, the Precision and Recall exhibit larger 

6 http:/ /www. grouplens.org/. 
7 http:/ /www. netflixprize.com/. 
8 http:/ /delicious. com/. 



irregularities since they concentrate on a tiny fraction of 
items (i.e., top-recommended items). For both the three 
indices, the adaptive algorithms perform well, even when 
the Precision and Recall of the static algorithm display- 
ing irregularities, the approximated algorithms accurately 
reproduce these irregularities. In addition, it is seen that 
the AAS performs better than the AAF especially for the 
sparser data sets, Netflix and Delicious. 
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Fig. 5: AUC, Precision and Recall as the function of the num- 
ber of edges fed to the algorithms on Delicious data. The 
algorithm starts with 5000 edges. Precision and Recall are 
evaluated with K — 100, 300 and 500, respectively. 

The most significant feature of the adaptive algorithms, 
AAF and AAS, is that they could follow the static algo- 
rithm tightly. For MovieLens and Netflix data, almost 
no difference between the adaptive algorithms and the 
static algorithm is observed, except a small departure of 
the AUC for the Netflix data in the start. Even for the 
Delicious data where there are observable errors of the 
approximations, the difference does not grow larger over 
time. This feature strongly encourages the usage of AAS 
to completely replace the costly static algorithm. It is 
worthwhile to emphasize that the adaptive algorithms will 
give more closer results to the static algorithm for denser 
data, since the error magnitude is negatively correlated 
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with the item degree and user degree (for AAF, see Eq. 
(5) and Eq. (6); for AAS, see Eq. (6)). One may doubt 
that although the error caused by the fourth type is small, 
its accumulative effects may be considerable and make the 
approximation farther and farther to the static algorithm. 
Fortunately, this will never happen in real systems be- 
cause when a new edge adjacent to an item a is added, 
all the errors of the elements of M relevant to a (i.e., m a . 
and m. a ) will be eliminated by AAS. Therefore, the er- 
rors relevant to active items will not cumulate, while the 
items rarely attaching new connections must be less at- 
tractive or out of date, which contribute very little to the 
recommendations . 

Conclusion and Discussion. — Although new data 
comes every moment, many information filtering systems 
still adopt the traditionally algorithmic framework, which 
periodically recompute on the whole data. Under the dy- 
namic environment of data flow, an urgent problem is how 
to design effective and efficient adaptive algorithms to give 
real-time responses to the change of data. A well-designed 
algorithm is supposed to not only provide fast and accu- 
rate responses, but also avoid the cumulative increase of 
errors, so that the global recomputation is unnecessary. 
However, for a general static recommendation algorithm, 
there is no guarantee of the existence of such an adaptive 
algorithm. In this Letter, being aware of the reversibil- 
ity of mass diffusion and heat conduction, we propose a 
fast and accurate adaptive algorithm with second-order 
approximation, which could follow the static algorithm 
very tightly and avoid the cumulative growth of errors. 

Although this Letter only presented the adaptive algo- 
rithms for mass diffusion process, the heat conduction pro- 
cess can be handled in an analogous way. Moreover, the 
linear integration of mass diffusion and heat conduction is 
shown to be able to provide much more accurate and di- 
verse recommendations than either single algorithm [10]. 
The proposed AAS can be straightforwardly extended to 
deal with this linear hybrid case. The present method can 
also find its applications for more complicated diffusion- 
based recommendation algorithms, for example, the diffu- 
sions on multi-rating systems [19,24] or hypcrgraphs in- 
volving users, items and tags [25,26]. 
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